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1 CCTGCCTGGT 
51 AGGAGCCCAT 
101 CCTGGCCGTG 
151 GGCGCCAACG 
201 GGGCTGGGCA 
251 CGACCAGTTG 
301 GGACGCCGGT 
351 GTGACCCACG 
401 GATCCTGGGT 
451 GTCTCTTGGA 
501 CGCCGCTTCG 
551 TCAGGAGGGA 
601 CTGTCCCCGT 
651 TTCCAAAAGG 
701 GATGACCTGG 
751 TGGCAGAGAT 
801 GAGAACCTGC 
851 CACCTCGACC 
901 ATGTGCAGCG 
951 CGGCGACCAG 
1001 GATTCATGAG 
1051 ATATGACATC 
1101 ACGACACTCA 
1151 GGAGAAGCCC 
1201 ACTTTGTGAA 
1251 TGCCTCGGGG 
1301 CCTGCTGCAG 
1351 GCCACCATGG 
1401 TGTGCTGTGC 
1451 GCCAGAGGCT 
1501 AAAAAAAAAA 
(SEQ ID NO: 1) 



CCTCTGTGCC 
TTGGTAGTGA 
ATAGTGGCCA 
CTGGGCTGCA 
ACCTGCTGCA 
CGGCGCCGCT 
GGTCGTGCTC 
GCGAGGACAC 
TTTGGGCCGC 
CAAAGCCGTG 
AGTACGACGA 
CTGAAGGAGG 
CCTCCTGCAT 
CTTTCCTGAC 
GACCCAGCCC 
GGAGAAGGCC 
GCATAGTGGT 
ACGCTGGCCT 
CCGTGTCCAA 
AGATGGGTGA 
GTGCAGCGCT 
CCGTGACATC 
TCACCAACCT 
TTCCGCTTCC 
GCCGGAGGCC 
AGCCCCTGGC 
CACTTCAGCT 
TGTCTTTGCT 
CCCGCTAGAA 
CTAATGTACA 
AAAAAAAAAA 



TGGTGGGGTG 
GGCAGGTATG 
TCTTCCTGCT 
CGCTACTCAC 
TGTGGACTTC 
TCGGGGACGT 
AATGGGCTGG 
CGCCGACCGC 
GTTCCCAAGG 
AGCAACGTGA 
CCCTCGCTTC 
AGTCGGGCTT 
ATCCCAGCGC 
CCAGCTGGAT 
AGCCCCCCCG 
AAGGGGAACC 
GGCTGACCTG 
GGGGCCTCCT 
CAGGAGATCG 
CCAGGCTCAC 
TTGGGGACAT 
GAAGTACAGG 
GTCATCGGTG 
ACCCCGAACA 
TTCCTGCCTT 
CCGCATGGAG 
TCTCGGTGCC 
TTCCTGGTGA 
TGGGGTACCT 
ATAAAGCAAT 
AAAAAAAAAA 



GGGGTGCCAG 
GGGCTAGAAG 
CCTGGTGGAC 
CAGGCCCCCT 
CAGAACACAC 
GTTCAGCCTG 
CGGCCGTGCG 
CCGCCTGTGC 
ACGCCCCTTT 
TCGCCTCCCT 
CTCAGGCTGC 
TCTGCGCGAG 
TGGCTGGCAA 
GAGCTGCTAA 
AGACCTGACT 
CTGAGAGCAG 
TTCTCTGCCG 
GCTCATGATC 
ACGACGTGAT 
ATGCCCTACA 
CGTCCCCCTG 
GCTTCCGCAT 
CTGAAGGATG 
CTTCCTGGAT 
TCTCAGCAGG 
CTCTTCCTCT 
CACTGGACAG 
CCCCATCCCC 
AGTCCCCAGC 
GTGGTAGTTC 
AAAAAAA 



GTGTGTCCAG 
CACTGGTGCC 
CTGATGCACC 
GCCACTGCCC 
CATACTGCTT 
CAGCTGGCCT 
CGAGGCGCTG 
CCATCACCCA 
CGCCCCAACG 
CACCTGCGGG 
TGGACCTAGC 
GTGCTGAATG 
GGTCCTACGC 
CTGAGCACAG 
GAGGCCTTCC 
CTTCAATGAT 
GGATGGTGAC 
CTACATCCGG 
AGGGCAGGTG 
CCACTGCCGT 
GGTGTGACCC 
CCCTAAGGGA 
AGGCCGTCTG 
GCCCAGGGCC 
CCGCCGTGCA 
TCTTCACCTC 
CCCCGGCCCA 
CTATGAGCTT 
CTGCTCCCTA 
CAAAAAAAAA 



FEATURES: 

5'UTR: 

Start Codon: 
Stop Codon: 
3'UTR: 



1-77 
78 

1416 
1419 



Homologous proteins: 

Top 10 BLAST Hits 



CRAt 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 



18000004889269 
18000004927597 
18000004923926 
18000005007118 
18000005100319 
18000004884804 
18000004889271 
18000004884803 
18000004939934 
18000005107537 



/altid= 
/altid= 
/altid= 
/altid= 
/altid= 
/altid= 
/altid= 
/altid= 
/altid= 
/altid= 



^gi|181304 /def=gb I AAA53500 . 1 | (M33388. 
^gi|4503223 /def =ref j NP_000097 . 1 | cyto. 
^gi|181306 /def=gb | AAA35737 . 1 | (M33189. 
gi I 2493367 /def =sp I Q29488 I CPDH_MACFA . 
gi I 3913340 /def =sp I 018992 | CPDJ_CALJA . 
^gi|486997 /def=pir I | S37284 cytochrome. 
gi|522195 /def=gb | AAA36403 . 1 ! (M24499. 
^gi|461826 /def=sp | Q01361 | CPDE_BOVIN C. 
^gi 1 117244 /def=sp | P13108 I CPD4_RAT CYT . 
gi I 2575863 /def =dbj I BAA23125 . 1 1 (ABOO. 



Score 






884 


0 


0 


883 


0 


0 


864 


0 


0 


827 


0 


0 


800 


0 


0 


682 


0 


0 


673 


0 


0 


669 


0 


0 


665 


0 


0 


665 


0 


0 



EST: 

Sequences producing significant alignments: 
gi I 9872134 /dataset=dbest /taxon=960 . . . 
gi 1 6144331 /dataset=dbest /taxon=9606 ... 
gi I 6703894 /dataset=dbest /taxon=9606 ... 



Score E 
(bits) Value 

775 0.0 

648 0.0 

648 0.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 
gi I 9872134 /liver 
gi| 6144331 /kidney 
gi I 6703894 /lung 



Tissue Expression: 
Whole Liver 



FIGURE 1 



} 



1 MGLEALVPLA VIVAIFLLLV DLMHRRQRWA ARYSPGPLPL PGLGNLLHVD 

51 FQNTPYCFDQ LRRRFGDVFS LQLAWTPVVV LNGLAAVREA LVTHGEDTAD 

101 RPPVPITQIL GFGPRSQGRP FRPNGLLDKA VSNVIASLTC GRRFEYDDPR 

151 FLRLLDLAQE GLKEESGFLR EVLNAVPVLL HIPALAGKVL RFQKAFLTQL 

201 DELLTEHRMT WDPAQPPRDL TEAFLAEMEK AKGNPESSFN DENLRIVVAD 

251 LFSAGMVTTS TTLAWGLLLM ILHPDVQRRV QQEIDDVIGQ VRRPEMGDQA 

301 HMPYTTAVIH EVQRFGDIVP LGVTHMTSRD lEVQGFRIPK GTTLITNLSS 

351 VLKDEAVWEK PFRFHPEHFL DAQGHFVKPE AFLPFSAGRR ACLGEPLARM 

401 ELFLFFTSLL QHFSFSVPTG QPRPSHHGVF AFLVTPSPYE LCAVPR 
(SEQ ID NO: 2) 



FEIATURES : 

Functional domains and key regions: 

[1] PDOCOOOOl PSOOOOl ASN_GLYCOSYLATION 
N-glycosylation site 



347-350 NLSS 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



327-329 TSR 



[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 

Number of matches: 5 

1 93-96 THGE 

2 198-201 TQLD 

3 238-241 SEND 

4 327-330 TSRD 

5 437-440 SPYE 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 2 

1 233-238 GNPESS 

2 255-260 GMVTTS 



[5] PDOC00009 PS00009 AMIDATION 
Amidation site 

Number of matches: 2 

1 140-143 CGRR 

2 387-390 AGRR 



[6] PDOC00081 PS00086 CYTOCHROME_P450 

Cytochrome P450 cysteine heme-iron ligand signature 



385-394 FSAGRRACLG 



Membrane spanning structure and domains: 



Helix 


Begin 


End 


Score 


Certainity 


1 


3 


23 


1.877 


Certain 


2 


68 


88 


1.096 


Certain 


3 


171 


191 


0.668 


Putative 


4 


252 


272 


1.914 


Certain 


5 


400 


420 


1.402 


Certain 


6 


425 


445 


0.833 


Putative 
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BLAST Alignment to Top Hit: 

>CRA1 18000004889269 /altid=gi 1 181304 /def=gb ! AAA53500 . 1 1 (M33388) 
cytochrome P450 IID6 [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /length=49'? 
Length =4 97 



Score = 884 bits (2259), Expect = 0.0 

Identities = 444/497 (89%), Positives = 445/497 (89%), Gaps - 51/497 (10%) 

MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYSPGPLPLPGLGNLLHVDFQNTPYCFDQ 60 
MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARY PGPLPLPGLGNLLHVDFQNTPYCFDQ 
MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQ 60 

LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQG — 118 
LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQG 



-RPFRPNGLLDK 129 
RPFRPNGLLDK 



AVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLLHIPALAGKV 



LRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNDENLRIVVA 



DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVI 



HEVQRFGDIVPLGVTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWEKPFRFHPEHF 



LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPTGQPRPSHHGV 



Query : 


1 

X 


Sb j ct : 


1 


Query : 


C 1 
Ol 


Sb j ct : 


C 1 


Query : 


1 1 Q 

i 1 y 


Sb j ct : 


1/1 L 


Query : 




oDj Ct . 


1 Q 1 
1 O 1 


Query : 


1 y u 


Sbjct : 


241 


Query: 


250 


Sbjct: 


301 


Query : 


310 


Sbjct: 


361 


Query: 


370 


Sbjct: 


421 


Query : 


430 


Sbjct: 


481 



FAFLV+PSPYELCAVPR 



Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains 

Score 



Model Description 

PF00067 Cytochrome P450 

Parsed for domains: 

Model Domain seq-f seq-t 



PF00067 
PF00067 



1/2 
2/2 



35 
117 



113 
443 



hmm- f hmm- t 



1 

150 



92 [. 
497 .] 



516.7 



E-value N 
1.7e-151 2 



score E-value 

78.1 2.7e-21 
442.7 3.3e-129 
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1 AGCCTTACAA AGTGCTGGGA TTACCTGCGT GAGCCACCGG GTCCGGCCTC 
51 TTTATGTCTT ACTGTACTGT CTGTCTTGAA AAGTACTTAT TATTTTTGAT 
101 TGGTTCATCA TTTAGTCTAA TTAAAATAAG AGTAGTTTAC ACACCACAAT 
151 TACAGTATTA TAATACTCTG TTTTTCTGTG TGCTTACTAT TACCAGTGAG 
201 TTTTGTACCT TTAGATGATT TCTTCTTGCT CATTAATATC CTTTTTTTTT 
251 TCAGATTGAA AAACTCCCTT TAGCATTTCT TGTGGGATAT AGGTCTGGTG 
301 TTGATGAAAT CTCGCAGCTT TTGTTTGTCT GGGAAGGTCT TTATTTCTCC 
351 TTCCTGTTGG AAGGATATTT TTGCCAGATA CGTTATTCTA GGCTAAAAGT 
401 TTTTTTTCCT TCAGCACTTT AAATATGTCA TGCCACTCCC CCCTGGCCTG 
451 TAAGGTTTCC ACTGGAAAGG TGGCTGCCCC ATGTCATGTA TTGGAGCTCT 
501 ACTGCATGTT ATTTGTTTCT TTTCTCTTGC TGCTTTTAGG ATCCTTTCTT 
551 TATCCTTGAC CTTTCGGAGT TTAATTATCA GATGCCTTGA GGTCGTCTTC 
601 TTTGGGTTAA ATCTGCTTGG TGTTCTATAA ACTTCTTGTA CAAAAAATCA 
651 GCCAGGCATG GTGGTGGGCA CCTGTAATCC CAGCTACTTG GGAGGCTGAG 
701 GCAGGAGAAT CGCTTGAACC CTGGAGGTGG AGGTTGCAGT GAGCCGAGAT 
751 CGCATCATTG CACTCCCACC TGGGCGACAG AGCAAAACTC CGTCTCAAAA 
801 AAAAAATTAT TTGGGCTCGG TGGTGCCTGT AGTCCCAGCT ACTTGGGAGG 
851 CAGGAGGTCC ACTTGATGTT GAGATTGCAG TGAGCCATGA TCCTGCCACT 
901 GCACTCCGGC CCGGGCAACA GAGTGAGACC CTGTCTAAAG AAAAAATAAA 
951 AATAAAAAAG CAACATATCC TAAATAAAGG ATCCTCCATA ATGTTTCCAC 
1001 CAGATTTCTA ATCAGAAACA TGGAGGCCAG G7VAGCAGTGG AGAATGACGA 
1051 CCCTCAGGCA GCCCTGGAGG ATGCTGTCAC AGGCTGGGGC AAGGGCCTTC 
1101 AGGCTACCAA CTGGGAGCTC TGGGAACAGC CCTGTTGCAA ACAGGAAGTC 
1151 ATGGCCCGGC CAGAGCCCAG AATGTGGGCT GAGCTGGGAT CCATGTGACA 
1201 GCTTTGAGGC TCACCGGGAG CAGCCTCTGG ACAGGAGAGG TCCCATCCAG 
1251 GAAACCTCGG GCATGGCTGG GAAGTGGGGT ACTTGGTGCC GGGTCTGTAT 
1301 GTGTGTGTGA CTGGTGTGTG TGAGAGAGAA TGTGTGCCCT GAGTGTCAGT 

0 1351 GTGAGTCTGT GTATGTGTGA ATATTGTCTT TGTGTGGGTG ATTTTCTGCA 
1401 TGTGTAATCG TGTCCCTGCA AGTGTGAACA AGTGGACAAG TGTCTGGGAG 
1451 TGGACAAGAG ATCTGTGCAC CATCAGGTGT GTGCATAGCG TCTGTGCATG 

y 1501 TCAAGAGTGC AAGGTGAAGT GAAGGGACCA GGCCCATGAT GCCACTCATC 

y 1551 ATCAGGAGCT CTAAGGCCCC AGGTAAGTGC CAGTGACAGA TAAGGGTGCT 

^ 1601 GAAGGTCACT CTGGAGTGGG CAGGTGGGGG TAGGGAAAGG GCAAGGTCAT 

'^f 1651 GTTCTGGAGG AGGGGTTGTG ACTACATTAG GGTGTATGAG CCTAGCTGGG 

4 1701 AGGTGGATGG CCGGGTCCAC TGAGACCCTG GTTATCCCAG AAGCCTGTGT 

g 1751 GGGCTTGGGG AGCTTGGAGT GGGGAGAGGG GGTGACTTCT CCGACCAGGC 

1 1801 CTTTCTACCA CCCTACCCTG GGTAAGGGCC TGGAGCAGGA AGCAGCGGCA 
y 1851 AGGACCTCTG GAGCAGCCCA TACCTGCCCT GGCCTGACTC TGCCACTGGC 

1901 AGCACAGTCA ACACAGCAGG TTCACTCACA GCAGAGGGCG AAGGCCATCA 
=1 1951 TCAGCTCCCT TTATAAGGGA AGGGTCACGC GCTCGGTGTG CCGAGAGTGT 

2001 CCTGCCTGGT CCTCTGTGCC TGGTGGGGTG GGGGTGCCAG GTGTGTCCAG 
y 2051 AGGAGCCCAG TTGGTAGTGA GGCAGCCATG GGGCTAGAAG CACTGGTGCC 

2101 CCTGGCCATG ATAGTGGCCA TCTTCCTGCT CCTGGTGGAC CTGATGCACC 
2151 GGCACCAACG CTGGGCTGCA CGCTACCCGC CAGGTCCCCT GCCACTGCCC 
2201 GGGCTGGGCA ACCTTGCTGC ATGTGGACTT CCAGAACACA CCATACTGCT 
2251 TCGACCAGGT GAGGGAGGAG GTCCTGGAGG GCGGCAGAGG TCCTGAGGAT 
2301 GCCCCACCAC CAGCAAACAT GGGTGGTGGG TTAAACCACA GGCTGGATCA 
2351 GAAGCCAGGC TGAGAAGGGG AAGCAGGTTT GGGGGACGTT CCTGGGGAAG 
2401 GACATTTATA CATGGCATGA AGGACTGGAT TTTCCAAAGG CCAAGGAAGA 
2451 GTAGGGCAAG GGCCTGGAGG TGGAGCTGGA CTTGGCAGTG GGCATGCAAG 
2501 CCCATTGGGC AACATATGTT ATGGAGTACA AAGTCCCTTC TGCTGACACC 
2551 AGAAGGAAAG GCCTTGGGAA TGGAAGATGA GTTAGTCCTG AGTGCCGTTT 
2601 AAATCACGAA ATCGAGGATG AAGGGGGTGC AGTGACCCGG TTCAAACCTT 
2651 TTGCACTGTG GGTCCTCGGG CCTCACTGCT CACCGGCATG GACCATCATC 
2701 TGGGAATGGG ATGCTAACTG GGGCCTCTCG GCAATTTTGG TGACTCTTGC 
2751 AAGGTCATAC CTGGGTGACG CATCCAAACT GAGTTCCTCC ATCACAGAAG 
2801 GTGTGACCCC CACCCCTGCC CCACGATCAG GAGGCTGGGT CTCCTCCTTC 
2851 CACCTGCTCA CTCCTGGTAG CCCCGGGGGT CGTCCAAGGT TCAAATAGGA 
2901 CTAGGACCTG TAGTCTGGGG TGATCCTGGC TTGACAAGAG GCCCTGACCC 
2951 TCCCTCTGCA GTTGCGGCGC CGCTTCGGGG ACGTGTTCAG CCTGCAGCTG 
3001 GCCTGGACGC CGGTGGTCGT GCTCAATGGG CTGGCGGCCG TGCGCGAGGC 
3051 GATGGTGACC CGCGGCGAGG ACACGGCCGA CCGCCCGCCT GCGCCCATCT 
3101 ACCAGGTCCT GGGCTTGGGG CCGCGTTCCC AAGGCAAGCG GCGGTGGGGG 
3151 ACAGAGACCG CGTTTCCGTG GGCCCCGGGT GGACAGTGAC CGTAGCCCAA 
3201 GCAGCGCCGA CAGGGCGTGG GGTCCTGGAC GTGAAACAGA GATAAAGGCC 
3251 AGCGAGTGGG CTGAGGACAG TGGGCCAGGA AACCACCTGC ACGGGGGAGG 
3301 TGCGAGTCTG TGGGCTGGGA GGGGGCGGGG CTACTGCCCA GACCCGCCAG 
3351 AAGCCCGGTG GGCGAGGCTG ATGCGTCGAA GTGGCGGTGG CGGGGACCGC 
3401 GCCTATGCTG CGGGCTCAGT GTGGGCGGGA CGGGCGGGAT CTTCCTTGAG 
3451 TGGAAAGGTG GTCAGGGTGG GCAGAGACGA GGTGGGGCCA AACCCCGCCC 
3501 CAGGCAGGGG AGCAATGTGG GTGAGCAAAG AGTGGGCCCT GTGCCCAGCT 



!! :: 
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3551 GGACCGGGCT AGGGACTGCG GGAGACCTTG TGGAGCGCCA GGGTTGGAGT 
3601 GGGTGGCGGA GGGTGGGGCC AAGGCCTTCA TGGCAACGCC CACGTGTCCG 
3651 TCCCGCCCCC AGGGGTGATC CTGTCGCGCT ATGGGCCCGC GTGGCGCGAG 
3701 CAGAGGCGCT TCTCCGTGTC CACCTTGCGC AACTTGGGCC TGGGCAAGAA 
37 51 GTCGCTGGAG CAGTGGGTGA CCGAGGAGGC CGCCTGCCTT TGTGCCGCTT 
3801 CGCCGACCAA GCCGGTGGGT GATGGGCAGA AGGGCACAAA GCGGGAACTG 
3851 GGAAGGCGGG GGACGGAGAA GGCAACCCCT TACCCGCATC TCCCCACCCC 
3901 CAGGACGCCC CTTTCGCCCC AACGGCCTCT TGGACAAAGC CGTGAGCAAC 
3951 GTGATCGCCT CCCTCACCTG CGGGCGCCGC TTCGAGTACG ACGACCCTCG 
4001 CTTCCTCAGG CTGCTGGACC TAGCTCAGGA GGGACTGAAG GAGGAGTCGG 
4051 GCTTTCTGCG CGAGGTGCGG AGCGAGAGAC CGAGGAGTCT CTGCAGGGCG 
4101 AGCTCCTGAG AGGTGCCGGG GCTGGACTGG GGCCTCCGAA GGGCAGGATT 
4151 TGCATAGATG GGTTTGGGAA AGGACATTCC AGGAGACCCC ACTGTAAGAA 
4201 GGGCCTGGAG GAGGAGGGGA CATCTCAGAC ATGGTCGTGG GAGAGGTGTG 
4251 CCCGGGTCAG GGGGCACCAG GAGAGGCCAA GGACTCTGTA CCCCCGTCCA 
4301 CGTTGGAGAT TTCGATTTTA GGTTTCTCCT CTGGGCAAGG AGAGAGGGTG 
4351 GAGGCTGGCA CTTGGGGAGG GACTTGGTGA GGTCAGTGGT AAGGACAGGC 
4 401 AGGCCCTGGG TCTACCTGGA GATGGCTGGG GCCTGAGACT TGTCCAGGTG 
4 451 AACGCAGAGC ACAGGAGGGA TTGAGACCCC GTTCTGTCTG GTGTAGGTGC 
4501 TGAATGCTGT CCCCGTCCTC CTGCACATCC CAGCGCTGGC TGGCAAGGTC 
4551 CTACGCTTCC AAAAGGCTTT CCTGACCCAG CTGGATGAGC TGCTAACTGA 
4 601 GCACAGGATG ACCTGGGACC CAGCCCAGCC ACCCCGAGAC CTGACTGAGG 
4 651 CCTTCCTGGC AAAGAAGGAG AAGGTGAGAG TGGCTGCCAC GGTGGGGGGC 
4701 AAGGGTGGTG GGTTGAACGT CCCAGGAGGA ATGAGGGGAG GCTGGGCAAA 
4751 AGGTTGGACC AGTGCATCAC CCGGCGAGCC GCATCTGGGC TGACAGGTGC 
4801 AGAATTGGAG GTCATTTGGG GGCTACCCCG TTCTATCCCC TGAGTATCCT 
4851 CTCGGCCCTG CTCAGGCCAA GGGGAGCCCT GAGAGCAGCT TCAATGATGA 
O 4901 GAACCTGCGC ATAGTGGTGG GTAACCTGTT CCTTGCCGGG ATGGTGACCA 

4951 CCTCGACCAC GCTGGCCTGG GGCCTCCTGC TCATGATCCT ACACCTGGAT 
:~ 5001 GTGCAGCGTG AGCCCAGCTG GGGCCCAAGG CAGGGACTGA GGGAGGAAGG 

40 5051 GTACAGCTGG GGGCCCCTGG GCTTAGCTGG GACACCCGGG GCTTCCAGCA 

fy 5101 CAGGCGTGGC CAGGCTCCTG TAAGCCTAAC TTCCTCCAAC ACAGGAGGAA 

5151 GGAGAGTGTC CCCTGGGTGC TGACCCATTG TGGGGACGCA TGTCTGTCCA 
^=1^ 5201 GTCCGTGTCC TVACAGGAGAT CGACGACGTG ATAGGGCAGG TGCGGCGACC 

\l 5251 AGAGATGGGT GACCAGGCTC ACATGCCCTA CACCACTGCC GTGATTCACG 

5301 AGGTGCAGCG CTTTGGGGAC ATCATCCCCC TGAGTGTGAC CCATATGACA 
5351 TCCCGTGACA TCGAAGTACA GGGCTTCCGC ATCCCTAAGG TAGGCCTGGC 
Cy 5401 GCCCTCCTCA CCCCAGCTCA GCACCAGCAC CTGGTGATAG CCCCAGCATG 

5451 GCTACTGCCA GGTGGGCCCA CTCTAGGAAC CCTGGCCACC TAGTCCTCAA 
5501 TGCCACCACA CTGACTGTCC CCACTTGGGT GGGGGGTCCA GAGTATAGGC 
5551 AGGGCTGGCC TGTCCATCCA GAGCCCCCGT CTAGTGGGGA GACAAACCAG 
hi 5601 GACCTGCCAG AATGTTGGAG GACCCAGCGC CTGCAGGGAG AGGGGGCAGT 

:V; 5651 GTGGGTGCCT CTGAGAGGTG TGACTGCGCC CTGCTGTGGG GTCGGAGAGG 

5701 GTACTGTGGA GCTTCTCGGG CGCAGGACTA GTTGACAGAG TCCAGCTGTG 
IJ 5751 TGCCAGGCAG TGTGTGTCCC CCGTGTGTTT GGTGGCAGGG GTCCCAGCAT 

f^j 5801 CCTAGAGTCC AGTCCCCACT CTCACCCTGC ATCTCCTGCC CAGGGAACGA 

.^"7 5851 CACTCATCAC CAACCTGTCA TCGGTGCTGA AGGATGAGGC CGTCTGGGAG 

5901 AAGCCCTTCC GCTTCCACCC CGAACACTTC CTGGATGCCC AGGGCCACTT 
5951 TGTGAAGCCG GAGGCCTTCC TGCCTTTCTC AGCAGGTGCC TGTGGGGAGC 
6001 CCGGCTCCCT GTCCCCTTCC GTGGAGTCTT GCAGGGGTAT CACCCAGGAG 
6051 CCAGGCTCAC TGACGCCCCT CCCCTCCCCA CAGGCCGCCG TGCATGCCTC 
6101 GGGGAGCCCC TGGCCCGCAT GGAGCTCTTC CTCTTCTTCA CCTCCCTGCT 
6151 GCAGCACTTC AGCTTCTCCG TGGCCGCCGG ACAGCCCCGG CCCAGCCACT 
6201 CTCGTGTCGT CAGCTTTCTG GTGACCCCAT CCCCCTACGA GCTTTGTGCT 
6251 GTGCCCCGCT AGAATGGGGT ACCTAGTCCC CAGCCTGCTC CCTAGCCAGA 
6301 GGCTCTAATG TACAATAAAG CAATGTGGTA GTTCCAACTT GGGTCCCCTG 
6351 CTCACGCCCT CGTTGGGATC ATCCTCCTCA GGGCAACCCC ACCCCTGCCT 
6401 CATTCCTGCT TACCCCACCG CCTGGCCGCA TTTGAGACGG GTACGTTGAG 
6451 GCTGAGCAGA TGTCAGTTAC CCTTGCCCAT AATCCCATGT CCCCCACTGA 
6501 CCCAACTCTG ACTGCCCAGA TTGGTGACAA GGACTACATT GTCCTGGCAT 
6551 GTGGGGAAGG GGCCAGAATG GGCTGACTAG AGGTGTCAGT CAGCCCTGGA 
6601 TGTGGTGGAG AGGGCAGGAC TCAGCCTGGA GGCCCATATT TCAGGCCTAA 
6651 CTCAGCCCAC CCCACATCAG GGACAGCAGT CCTGCCAGCA CCATCACAAC 
6701 AGTCACCTCC CTTCATATAT GACACCCCAA AATGGAAGAC AAATCATGTC 
6751 AGGGAGCTAT ATGCCAGGGC TACCTCCCAG GGCTCAGTCG GCAGGTGCCA 
6801 GAACATTCCC TGGGAAGGCC CCAGGAAAAC CCAGGACCGA GCCACCGCCC 
6851 TCAGCCTGTC ACCTTGTGTC CAAAATTGGT GGGTTCTTGG TCTCACTGAC 
6901 TTCAAGAATG AAGCCGTGGA CCCTCACGGT GAGTGTTACA GTTCTTAAAG 
6951 ATGGTGTGTT CAGAGTTTGT TCCTTCTGAT GTTAAGACGT GTTCAGAGTT 
7001 TCTTCCTTCT GGTGGGTGCG TGGTCTTGCT GGCTTCAGGA GTGAAGCTGC 
7051 AGACCTTCAC AGTGAGTGTT ACGGCTCTTA AGGCTGCACG TACGGAGTTG 
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7101 TTCATTCTTC CTGGTGGGTT TGTGGTCTCA CTGGCCTCAG GAGTGAAACT 
7151 GCAGTCCTTC CAGTGTTACA ACTCATAAAG GCAGTGTGGA CCCAATGAGG 
7201 GAGCAGCAGC AGCAAGACTT ACTGCAAACA GCAAAAGAAT GATGGCAACC 
7251 AGGTTGCCGC TGCTACTTCA GGCAGCCTGC TTTTATTCCC TTATCTGACC 
7301 CCCACCCACA TCCTGCTGAT TGGCCCATTT TACAGACAGT GGATTGGTCC 
7351 ACTTACAGAG AGCTGATTGG TGCATTTACA ATCCCTGAGC TAGACACAGA 
7401 GTACTGATTG GTATATTTAC AAACCTTGAG CTAGACACAG AGTGCTGAAT 
74 51 GGTGTATTTA CAATCCCTTA GCTAGACATA AAGGTTGTCC CAGTCCCCAC 
7501 TAGATTAGCT AGATAGAGTA GACAGAGAGG ACTGATTGGT GCGTTTACAA 
7551 ACCTTGAGTT AGACACAGGG TGCTGACTGG TGTGTTTACA AACCTTGAGC 
7 601 TAGACACAGA GTGCTGATTG GTGTATTTAC AATCTTTTAG CTAGAAATAA 
7651 AGGTTCCCCA AGTCCCCACC AGATTAGCTA GATAGAGTGC TAATTGGTGC 
7701 ATGCACGAAC CCGGAGCTAG ACACAGAGTG CTGATTGGTG CATATACAAT 
7751 CCTCTGGCTA GACATAAAAG TTCTCCAAGT CCCCACCTGA CTCAGGAGCC 
7801 CAGCCAGCTT CGCCTAGTGG ATCCTATGCC AGGGCCACAG GCAGAGCTGC 
7851 CTGCTAGTCC CACACCGGGC ACCTGTACTC CTCAGCCCTT GGGCAGTGGA 
7901 CGGGACCAGG TGCCGTGGAG CAGTGGGAGG CACCCATCCG GGAGGCTCGG 
7951 GCCTCGCAGG GAGCCCACCG TAGGGAGGCT TGGGCATGGC AGGCTGCAAG 
8001 TCCTGAGCCC TGCCCCGCGG GGAGGTGACT GAGGCCTGGC GACAATTCAA 
8051 GTGTGGTGAG CGCCGGCAGG CCAGCAGTAC TGGGGGACCC GGTGCCCCCT 
8101 CTGCAGCTGC TGGCCCAGGT GCTAAGCCCC TCACTGCCTG GGGCCAGAGG 
8151 CACCAGCCGG CCGCTCCGAG TGCAGGGCCC GCTGAGCCCC TGCCCACCCA 
8201 GAACTGGTGC TGGCCCGCGA GCAACCCAGG TTCCCGCACA CGCCTCTCCC 
8251 TCCATACCTC CCCGCAAGCA GACGGAGCCG GCTCCAGCCT CCACCAGTCC 
8301 AGAGAGGGGC TCCCACAGTG CAGCGCTGGG CTGAACAAGG TCCTACGCTT 
8351 CCAAAAGGCT TTCCTGACCC AGCTGGATGA GCTGCTAACT GAGCACAGGA 
_^ 8401 TGACCTGGGA CCCAGCCCAG CCCCCCCGAG ACCTGACTGA GGCCTTTCCT 

tj 8451 GGCAGAGATG GAGAAGGTGA GAGTGGCTGC CACGGTGGGG GGCAAGGGTG 

8501 GTGGGTTGAG CGTCCCAGGA GGAATGAGGG GAGGCTGGGC AAAAGGTTGG 
'yZ 8551 ACCAGTGCAT CACCCGGCGA GCCGCATCTG GGCTGACAGG TGCAGAATTG 

W 8 601 GAGGTCATTT GGGGGCTACC CCGTTCTGTC CCGAGTATGC TCTCGGCCCT 

fU 8651 GCTCAGGCCA AGGGGAACCC TGAGAGCAGC TTCAATGATG AGAACCTGCG 

8701 CATAGTGGTG GCTGACCTGT TCTCTGCCGG GATGGTGACC ACCTCGACCA 
8751 CGCTGGCCTG GGGCCTCCTG CTCATGATCC TACATCCGGA TGTGCAGCGT 
""'^J 8801 GAGCCCATCT GGGAAACAGT GCAGGGGCCG AGGGAGGAAG GGTACAGGCG 

fn 8851 GGGGCCCATG AACTTTGCTG GGACACCCGG GGCTCCAAGC ACAGGCTTGA 

I'i 8901 CCAGGATCCT GTAAGCCTGA CCTCCTCCAA CATAGGAGGC AAGAAGGAGT 

8951 GTCAGGGCCG GACCCCCTGG GTGCTGACCC ATTGTGGGGA CGCATGTCTG 
s 9001 TCCAGGCCGT GTCCAACAGG AGATCGACGA CGTGATAGGG CAGGTGCGGC 

9051 GACCAGAGAT GGGTGACCAG GCTCACATGC CCTACACCAC TGCCGTGATT 
'-^^ 9101 CATGAGGTGC AGCGCTTTGG GGACATCGTC CCCCTGGGTG TGACCCATAT 

Ly 9151 GACATCCCGT GACATTCGAA GTACAGGGCT TCCGCATCCC TAAGGTAGGC 

?,y 9201 CTGGCGCCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

ZZ 9251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9301 NNNNNNNCCT GCCCAGGGAA CGACACTCAT CACCAACCTG TCATCGGTGC 
Q 9351 TGAAGGATGA GGCCGTCTGG GAGAAGCCCT TCCGCTTCCA CCCCGAACAC 

n 9401 TTCCTGGATG CCCAGGGCCA CTTTGTGAAG CCGGAGGCCT TCCTGCCTTT 

^~ 9451 CTCAGCAGGT GCCTGTGGGG AGCCCGGCTC CCTGTCCCCT TCCGTGGAGT 

9501 CTTGCAGGGG TATCACCCAG GAGCCAGGCT CACTGACGCC CCTCCCCTCC 
9551 CCACAGGCCG CCGTGCATGC CTCGGGGAGC CCCTGGCCCG CATGGAGCTC 
9601 TTCCTCTTCT TCACCTCCCT GCTGCAGCAC TTCAGCTTCT CGGTGCCCAC 
9651 TGGACAGCCC CGGCCCAGCC ACCATGGTGT CTTTGCTTTC CTGGTGAGCC 
9701 CATCCCCCTA TGAGCTTTGT GCTGTGCCCC GCTAGAATGG GGTACCTAGT 
9751 CCCCAGCCTG CTCCCTAGCC AGAGGCTCTA ATGTACAATA AAGCAATGTG 
9801 GTAGTTCCAA CTCGGGTCCC CTGCTCACGC CCTCGTTGGG ATCATCCTCC 
9851 TCAGGGCAAC CCCACCCCTG CCTCATTCCT GCTTACCCCA CCGCCTGGCC 
9901 GCATTTGAGA CAGGGGTACG TTGAGGCTGA GCAGATGTCA GTTACCCTTG 
9951 CCCATAATCC CATGTCCCCC ACTGACCCAA CTCTGACTGC CCAGATTGGT 
10001 GACAAGGACT ACATTGTCCT GGCATGTGGG GAAGGGGCCA GAATGGGCTG 
10051 ACTAGAGGTG TCAGTCAGCC CTGGATGTGG TGGAGAGGGC AGGACTCAGC 
10101 CTGGAGGCCC ATATTTCAGG CCTAACTCAG CCCACCCCAC ATCAGGGACA 
10151 GCAGTCCTGC CAGCACCATC ACAACAGTCA CCTCCCTTCA TATATGACAC 
10201 CCCAAAACGG AAGACAAATC ATGGCGTCAG GGAGCTATAT GCCAGGGCTA 
10251 CCTACCTCCC AGGGCTCAGT CGGCAGGT 
(SEQ ID NO: 3) 
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Intron: 


3134-3903 






Exon: 


3904-4064 






Intron: 


4065-4496 






Exon: 


4497-4673 






Intron: 


4674-4865 






Exon : 


4866-5007 






Intron: 


5008-5201 






Exon : 


5202-5389 






Intron: 


5390-5843 






Exon: 


5844-5985 






Intron : 


5986-9556 






Exon : 


9557-9732 






Stop 


9733 






SNPs: 








DNA 








Position 


Major 


Minor 


Domain 


3101 


C 


T A 


Exon 


3439 


A 


G 


Intron 


4908 


C 


T 


Exon 


5627 


G 


A 


Intron 


6733 


T 


C 


Intron 


7788 




C T 


Intron 


7867 


G 


A 


Intron 


7948 


C 


T 


Intron 



Protein 

Position Major 



107 
245 



Minor 
T T 
L 



5 c 



Context: 
DNA 

Position 
3101 



3439 



4908 



GTGTGACCCCCACCCCTGCCCCACGATCAGGAGGCTGGGTCTCCTCCTTCCACCTGCTCA 
CTCCTGGTAGCCCCGGGGGTCGTCCAAGGTTCAAATAGGACTAGGACCTGTAGTCTGGGG 
TGATCCTGGCTTGACAAGAGGCCCTGACCCTCCCTCTGCAGTTGCGGCGCCGCTTCGGGG 
ACGTGTTCAGCCTGCAGCTGGCCTGGACGCCGGTGGTCGTGCTCAATGGGCTGGCGGCCG 
TGCGCGAGGCGATGGTGACCCGCGGCGAGGACACGGCCGACCGCCCGCCTGCGCCCATCT 
[C,T,A] 

CCAGGTCCTGGGCTTCGGGCCGCGTTCCCAAGGCAAGCGGCGGTGGGGGACAGAGACCGC 
GTTTCCGTGGGCCCCGGGTGGACAGTGACCGTAGCCCAAGCAGCGCCGACAGGGCGTGGG 
GTCCTGGACGTGAAACAGAGATAAAGGCCAGCGAGTGGGCTGAGGACAGTGGGCCAGGAA 
ACCACCTGCACGGGGGAGGTGCGAGTCTGTGGGCTGGGAGGGGGCGGGGCTACTGCCCAG 
ACCCGCCAGAAGCCCGGTGGGCGAGGCTGATGCGTCGAAGTGGCGGTGGCGGGGACCGCG 

CGGCGGTGGGGGACAGAGACCGCGTTTCCGTGGGCCCCGGGTGGACAGTGACCGTAGCCC 
AAGCAGCGCCGACAGGGCGTGGGGTCCTGGACGTGAAACAGAGATAAAGGCCAGCGAGTG 
GGCTGAGGACAGTGGGCCAGGAAACCACCTGCACGGGGGAGGTGCGAGTCTGTGGGCTGG 
GAGGGGGCGGGGCTACTGCCCAGACCCGCCAGAAGCCCGGTGGGCGAGGCTGATGCGTCG 
AAGTGGCGGTGGCGGGGACCGCGCCTATGCTGCGGGCTCAGTGTGGGCGGGACGGGCGGG 
[A,G] 

TCTTCCTTGAGTGGAAAGGTGGTCAGGGTGGGCAGAGACGAGGTGGGGCCAAACCCCGCC 
CCAGGCAGGGGAGCAATGTGGGTGAGCAAAGAGTGGGCCCTGTGCCCAGCTGGACCGGGC 
TAGGGACTGCGGGAGACCTTGTGGAGCGCCAGGGTTGGAGTGGGTGGCGGAGGGTGGGGC 
CAAGGCCTTCATGGCAACGCCCACGTGTCCGTCCCGCCCCCAGGGGTGATCCTGTCGCGC 
TATGGGCCCGCGTGGCGCGAGCAGAGGCGCTTCTCCGTGTCCACCTTGCGCAACTTGGGC 

ATGACCTGGGACCCAGCCCAGCCACCCCGAGACCTGACTGAGGCCTTCCTGGCAAAGAAG 
GAGAAGGTGAGAGTGGCTGCCACGGTGGGGGGCAAGGGTGGTGGGTTGAACGTCCCAGGA 
GGAATGAGGGGAGGCTGGGCAAAAGGTTGGACCAGTGCATCACCCGGCGAGCCGCATCTG 
GGCTGACAGGTGCAGAATTGGAGGTCATTTGGGGGCTACCCCGTTCTATCCCCTGAGTAT 
CCTCTCGGCCCTGCTCAGGCCAAGGGGAGCCCTGAGAGCAGCTTCAATGATGAGAACCTG 
EC,T] 

GCATAGTGGTGGGTAACCTGTTCCTTGCCGGGATGGTGACCACCTCGACCACGCTGGCCT 
GGGGCCTCCTGCTCATGATCCTACACCTGGATGTGCAGCGTGAGCCCAGCTGGGGCCCAA 
GGCAGGGACTGAGGGAGGAAGGGTACAGCTGGGGGCCCCTGGGCTTAGCTGGGACACCCG 
GGGCTTCCAGCACAGGCGTGGCCAGGCTCCTGTAAGCCTAACTTCCTCCAACACAGGAGG 
AAGGAGAGTGTCCCCTGGGTGCTGACCCATTGTGGGGACGCATGTCTGTCCAGTCCGTGT 



5627 



CCCCTGAGTGTGACCCATATGACATCCCGTGACATCGAAGTACAGGGCTTCCGCATCCCT 
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f 

AAGGTAGGCCTGGCGCCCTCCTCACCCCAGCTCAGCACCAGCACCTGGTGATAGCCCCAG 
CATGGCTACTGCCAGGTGGGCCCACTCTAGGAACCCTGGCCACCTAGTCCTCAATGCCAC 
CACACTGACTGTCCCCACTTGGGTGGGGGGTCCAGAGTATAGGCAGGGCTGGCCTGTCCA 
TCCAGAGCCCCCGTCTAGTGGGGAGACAAACCAGGACCTGCCAGAATGTTGGAGGACCCA 
[G,A] 

CGCCTGCAGGGAGAGGGGGCAGTGTGGGTGCCTCTGAGAGGTGTGACTGCGCCCTGCTGT 
GGGGTCGGAGAGGGTACTGTGGAGCTTCTCGGGCGCAGGACTAGTTGACAGAGTCCAGCT 
GTGTGCCAGGCAGTGTGTGTCCCCCGTGTGTTTGGTGGCAGGGGTCCCAGCATCCTAGAG 
TCCAGTCCCCACTCTCACCCTGCATCTCCTGCCCAGGGAACGACACTCATCACCAACCTG 
TCATCGGTGCTGAAGGATGAGGCCGTCTGGGAGAAGCCCTTCCGCTTCCACCCCGAACAC 

6733 TGAGACGGGTACGTTGAGGCTGAGCAGATGTCAGTTACCCTTGCCCATAATCCCATGTCC 
CCCACTGACCCAACTCTGACTGCCCAGATTGGTGACAAGGACTACATTGTCCTGGCATGT 
GGGGAAGGGGCCAGAATGGGCTGACTAGAGGTGTCAGTCAGCCCTGGATGTGGTGGAGAG 
GGCAGGACTCAGCCTGGAGGCCCATATTTCAGGCCTAACTCAGCCCACCCCACATCAGGG 
ACAGCAGTCCTGCCAGCACCATCACAACAGTCACCTCCCTTCATATATGACACCCCAAAA 
(T,C] 

GGAAGACAAATCATGTCAGGGAGCTATATGCCAGGGCTACCTCCCAGGGCTCAGTCGGCA 
GGTGCCAGAACATTCCCTGGGAAGGCCCCAGGAAAACCCAGGACCGAGCCACCGCCCTCA 
GCCTGTCACCTTGTGTCCAAAATTGGTGGGTTCTTGGTCTCACTGACTTCAAGAATGAAG 
CCGTGGACCCTCACGGTGAGTGTTACAGTTCTTAAAGATGGTGTGTTCAGAGTTTGTTCC 
TTCTGATGTTAAGACGTGTTCAGAGTTTCTTCCTTCTGGTGGGTGCGTGGTCTTGCTGGC 

TCCCAGTCCCCACTAGATTAGCTAGATAGAGTAGACAGAGAGCACTGATTGGTGCGTTTA 
CAAACCTTGAGTTAGACACAGGGTGCTGACTGGTGTGTTTACAAACCTTGAGCTAGACAC 
AGAGTGCTGATTGGTGTATTTACAATCTTTTAGCTAGAAATAAAGGTTCCCCAAGTCCCC 
ACCAGATTAGCTAGATAGAGTGCTAATTGGTGCATGCACGAACCCGGAGCTAGACACAGA 
GTGCTGATTGGTGCATATACAATCCTCTGGCTAGACATAAAAGTTCTCCAAGTCCCCACC 

GACTCAGGAGCCCAGCCAGCTTCGCCTAGTGGATCCTATGCCAGGGCCACAGGCAGAGCT 
GCCTGCTAGTCCCACACCGGGCACCTGTACTCCTCAGCCCTTGGGCAGTGGACGGGACCA 
GGTGCCGTGGAGCAGTGGGAGGCACCCATCCGGGAGGCTCGGGCCTCGCAGGGAGCCCAC 
CGTAGGGAGGCTTGGGCATGGCAGGCTGCAAGTCCTGAGCCCTGCCCCGCGGGGAGGTGA 
CTGAGGCCTGGCGACAATTCAAGTGTGGTGAGCGCCGGCAGGCCAGCAGTACTGGGGGAC 

AGGGTGCTGACTGGTGTGTTTACAAACCTTGAGCTAGACACAGAGTGCTGATTGGTGTAT 
TTACAATCTTTTAGCTAGAAATAAAGGTTCCCCAAGTCCCCACCAGATTAGCTAGATAGA 
GTGCTAATTGGTGCATGCACGAACCCGGAGCTAGACACAGAGTGCTGATTGGTGCATATA 
CAATCCTCTGGCTAGACATAAAAGTTCTCCAAGTCCCCACCTGACTCAGGAGCCCAGCCA 
GCTTCGCCTAGTGGATCCTATGCCAGGGCCACAGGCAGAGCTGCCTGCTAGTCCCACACC 
[G,A] 

GGCACCTGTACTCCTCAGCCCTTGGGCAGTGGACGGGACCAGGTGCCGTGGAGCAGTGGG 
AGGCACCCATCCGGGAGGCTCGGGCCTCGCAGGGAGCCCACCGTAGGGAGGCTTGGGCAT 
GGCAGGCTGCAAGTCCTGAGCCCTGCCCCGCGGGGAGGTGACTGAGGCCTGGCGACAATT 
CAAGTGTGGTGAGCGCCGGCAGGCCAGCAGTACTGGGGGACCCGGTGCCCCCTCTGCAGC 
TGCTGGCCCAGGTGCTAAGCCCCTCACTGCCTGGGGCCAGAGGCACCAGCCGGCCGCTCC 

TAAAGGTTCCCCAAGTCCCCACCAGATTAGCTAGATAGAGTGCTAATTGGTGCATGCACG 
AACCCGGAGCTAGACACAGAGTGCTGATTGGTGCATATACAATCCTCTGGCTAGACATAA 
AAGTTCTCCAAGTCCCCACCTGACTCAGGAGCCCAGCCAGCTTCGCCTAGTGGATCCTAT 
GCCAGGGCCACAGGCAGAGCTGCCTGCTAGTCCCACACCGGGCACCTGTACTCCTCAGCC 
CTTGGGCAGTGGACGGGACCAGGTGCCGTGGAGCAGTGGGAGGCACCCATCCGGGAGGCT 
[C,T] 

GGGCCTCGCAGGGAGCCCACCGTAGGGAGGCTTGGGCATGGCAGGCTGCAAGTCCTGAGC 
CCTGCCCCGCGGGGAGGTGACTGAGGCCTGGCGACAATTCAAGTGTGGTGAGCGCCGGCA 
GGCCAGCAGTACTGGGGGACCCGGTGCCCCCTCTGCAGCTGCTGGCCCAGGTGCTAAGCC 
CCTCACTGCCTGGGGCCAGAGGCACCAGCCGGCCGCTCCGAGTGCAGGGCCCGCTGAGCC 
CCTGCCCACCCAGAACTGGTGCTGGCCCGCGAGCAACCCAGGTTCCCGCACACGCCTCTC 

Chromosome mapping: 

Chromosome #22 



7788 




7867 



7948 
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